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Abstract. We provide an explicit formula for the limiting free energy density (log-partition 
function divided by the number of vertices) for ferromagnetic Potts models on uniformly 
sparse graph sequences converging locally to the d-regular tree for d even, covering all 
temperature regimes. This formula coincides with the Bethe free energy functional evaluated 
at a suitable fixed point of the belief propagation recursion on the d-regular tree, the so- 
called replica symmetric solution. For uniformly random d-regular graphs we further show 
that the replica symmetric Bethe formula is an upper bound for the asymptotic free energy 
for any model with permissive interactions. 

1. Introduction 

Let G = (V, E) be a finite undirected graph, and 3£ a finite alphabet of spins. A factor 
model on G is a probability measure on the space of (spin) configurations a 6 2£ v of the 
form 

(y')e-E iev 

where if) is a symmetric function ^T 2 — > M>o, if) is a positive function — > R>o, and 
^g(^) = is the normalizing constant, called the partition function (with its logarithm 
called the free energy). The pair ifi = (if), if)) is called a specification for the factor model (1). 
We assume the specification is permissive, that is, there exists cr p e with min^ if)(o~, o~ p ) > 
0. 

A primary example we consider in this paper is the q-state Potts model on G with inverse 
temperature and magnetic field B, given by specification 

iP{a,o') = e^=°'\ 4,{o) = e B1 ^= l \ X = [q] = {1, . . . , q}. (2) 

We write Vq for the corresponding measure on [q] v . The model is said to be ferromagnetic 
if (3 > 0, and anti-ferromagnetic otherwise. 

In this paper we study the asymptotics of the free energy for factor models (1) on graph 
sequences G n = (V n , E n ) converging locally to the ci-regular tree (d > 3) in the sense of 
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Benjamini-Schramm [5] (see Defn. 1.1). This class includes in particular any sequence of 
(i-regular graphs with girth (minimal cycle length) diverging to infinity. 

The study of statistical mechanics on regular trees has a long history, initiated by Bethe [6]. 
While tree graphs do not capture the finite-dimensional structure of actual physical systems, 
models on trees are often amenable to exact analysis. Further, it is often argued that 
they are a good approximation to models on the lattice Z d for large d or for long interaction 
range [30, 1, 9, 29]. According to this expectation, models on trees provide a flexible and well- 
defined approach for investigating mean-field theory (i.e. the behavior of statistical mechanics 
models in high dimensions). 

While this expectation proves to be correct in a number of examples, it has recently 
become clear that, in many cases, models on trees fail to capture the "correct" mean- field 
behavior. Spin glasses provide an important example of this phenomenon: a fairly natural 
class of spin glasses on trees was introduced by Thouless [29] and further characterized by 
Chayes et al. [10]. However, the thermodynamic behavior observed there is very different 
from the widely accepted mean-field theory of spin glasses, as obtained from analysis of the 
Sherrington-Kirkpatrick (sk) model [24, 28]. In particular, the low-temperature phase of 
the tree models defined in [29] does not exhibit replica symmetry breaking (in contrast with 
sk) . A similar discrepancy was observed in the case of Anderson localization by Aizenman- 
Warzel [3]. 

In the case of spin glasses, Mezard-Parisi [23] argued that this difference arises because 
of a particular feature of tree graphs: in the subgraph induced by the first i levels of the 
regular tree, the leaves constitute a non- vanishing fraction of the vertices as t — > oo. They 
suggested that mean-field theory ought instead to be defined by considering graphs that 
are not themselves trees, but "look like regular trees" in the neighborhood of a typical 
vertex (which fails for the depth-£ subtree of the regular tree) - - the canonical example 
being the (uniformly) random (i-regular graph ensemble. This approach allows to reconcile 
discrepancies in several known cases. In particular, spin glasses on random regular graphs 
are expected to exhibit replica symmetry breaking with features analogous to the SK model 
(see [23] and [22, Ch. 17]). 

Let us also mention that the study of statistical mechanics models on locally tree-like 
graphs has attracted renewed interest because of the connection with random combinatorial 
problems, such as fc-SAT and graph coloring. Statistical physicists were indeed able to com- 
pute threshold locations for these models by analyzing suitable Gibbs measures on locally 
tree-like structures [25, 20, 22]. Rigorous verification of these predictions is an outstanding 
mathematical challenge. 

In this paper we consider the existence and value of the free energy density (asymptotic 
free energy per spin) 

= lim 4> n = lim -E n [logZ n ], Z n = Z Gn (±), (3) 

for G n a (possibly random) graph sequence converging locally to the regular tree and E n 
expectation over the law of G n . For Ising (specification (2) with q = 2) models in the 
ferromagnetic regime, for any graph sequence with uniformly integrable average degree con- 
verging locally to a (possibly random) tree, the free energy density (3) exists and depends 
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only on the limiting tree [15, 17, 16]. The computation of <fi allows to compute various limits 
of interest with respect to the v Gn , as done for example in [26, 17]. Proving existence of the 
free energy density for q > 2 and general specification ^ poses several challenges: 1 

1. There are examples in which the free energy density (3) depends not only on the limiting 
tree but also on the particular graph sequence. For example, in the anti-ferromagnetic Ising 
model at sufficiently low temperature (sufficiently negative (3), it is not difficult to show that 
the free energy per spin on random <i-regular graphs is asymptotically lower than on random 
bipartite <i-regular graphs. As a consequence local weak convergence is not in full generality 
a sufficient condition for existence of the limit (3). 

2. Statistical physicists have put forth a number of conjectures (corresponding to different 
models or regimes) on the free energy density (3) (see e.g. [24, 22]). This analysis generally 
imposes a probability distribution on the graph G n which is suitable for calculations, typically 
the Erdos-Renyi or configuration models. Ensuing rigorous work has also focused on the same 
random graph ensembles (see e.g. [28]) rather than understanding which graph sequences in 
general have a limit (3). In this paper we focus instead on individual graph sequences. 

Characterizing the limit for ensembles of uniformly random graphs is already beyond 
current techniques for many factor models (1). Achieving the same goal for general locally 
tree-like graph sequences is all the more difficult, and requires to go beyond what is known 
from physics methods. A simple example is provided again by the anti-ferromagnetic Ising 
model: existence of the limit can be proved by a combinatorial interpolation [4], but even a 
heuristic prediction of the value is unavailable. 

In contrast, as mentioned above the free energy density for the ferromagnetic Ising model 
on locally tree-like graphs exists and can be computed. Its value is given by the Bethe 
prediction <3>, which is expressed in terms of a certain fixed point of a distributional recursion 
(given in (6) in the d- regular setting; see [14, 16] for the general case). This result was proved 
in the case of Galton- Watson limiting trees in [15] via an interpolation scheme. In [16] a 
generalized scheme was developed which gave the result for Ising on general limiting trees. 
The method was applied also to show liminf n n > $ in the ferromagnetic g-Potts model 
with q > 2, but could only pin down <fi = $ in limited regimes of ((3,B). The difficulty of 
the Potts model with q > 2 may be understood as follows: by a monotonicity argument, 
the local weak limit of Potts measures on G n is sandwiched between the free and maximally 
1-biased Gibbs measures on the limiting tree. When q = 2 these measures coincide for 
any > 0, .B > 0, 2 but when q > 2 the measures disagree in certain regimes of ((3,B). 
This corresponds to the appearance of "multiple stable fixed points" in the distributional 
recursion (6) as soon as q > 2. 

In this paper we establish the existence of the free energy density (3), and provide an 
explicit expression for its value, on graphs converging to regular trees of even degree and for 
all q > 2. Let us mention that the statistical physics folklore prescribes that the distributional 
fixed point with the highest Bethe free energy density should be selected. However, in the 
physics literature this is justified only via analogy with other models, without providing 

1 Existence of (3) for general 2k is equivalent to right convergence of G n in the language of [8]. 
2 Equivalently, there is only one Gibbs measure v on that satisfies the following properties: (i) v is 
invariant under automorphisms of TLj; (ii) v is a Markov chain on T^; and (iii) v(a = +1) > 0. 
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arguments which apply to locally tree-like graphs. Our result is the first rigorous verification 
of this variational principle in a non-trivial example for locally tree-like graphs. 

A different variational principle was proved in [18, 2] for mean-field spin glass models, 
but in that case the free energy density needs to be minimized. This difference is typically 
attributed by physicists to the difference between ferromagnetic and spin glass models; it 
remains an outstanding challenge to understand these two variational principles within a 
common framework. In the context of models on sparse graphs, Contucci et al. [12] recently 
proved that the variational principle of [18, 2] provides a bound on the free energy of anti- 
ferromagnetic Potts models, which was proved to be tight at high temperature. 

The rest of the paper is organized as follows: in the remainder of this introductory section 
we review the definitions of local convergence and the Bethe prediction and formally state 
our results, which we divide into two categories: in §2 we study the Bethe prediction on the 
uniformly random <i-regular graph ensemble. In §3-4 we prove results in the more general 
setting of graphs converging locally to the ci-regular tree. In each case we first consider 
general specifications ^ before specializing the the Potts specification (2). 

1.1. Local convergence. If G is any graph and U any subgraph, we write dll for the 
external boundary of U in G (the set of vertices in G adjacent to but not contained in U). 
For any vertex v of G, we let D v = \dv\ denote its degree, and write B t (v) for the subgraph 
induced by the vertices of G at graph distance at most t from v. Fix d throughout and let 
Td = (Trf, o) denote the <i-regular tree rooted at o, with = B t (p) the subtree of depth t. 

Definition 1.1. For G = ( V, E) finite undirected, let ( t {G) = |V|— G V : B t (v) ¥ 
where = denotes graph isomorphism. The sequence of (random) graphs G n = (V n = [n], E n ) 
is said to converge locally to the d-regular tree if for all t > 0, Ct(C?n) — > in probability 
as n — > oo. 

For G = (V, E) let Iq denote a vertex chosen uniformly at random from V, and write 
I n = I Gn . From now on let P n denote the joint law of (G n ,I n ), and E n the expectation 
with respect to P n . An equivalent definition of the local convergence of G n to is that 
linin-,00 F n (B t (I n ) — T*,) = 1 for all t > 0. 

Definition 1.2. The sequence G n is uniformly sparse if the random variables Dj n are uni- 
formly integrable, that is, if 

lim limsupEjD^lp^ > L}] = 0. 

We assume throughout that G n (n > 1) is a uniformly sparse graph sequence converging 
locally to the <i-regular tree T^. This setting is hereafter denoted G n -^-i oc T^. 

1.2. The Bethe prediction. 

1.2.1. Definition in d-regular setting. We now describe the Bethe free energy prediction in 
the special setting of d- regular trees; for a more general description see [14, 16]. We write a 
for elements of the finite alphabet ^ of spins, and a for vectors with entries in ; supposing 
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first that d is even, for a G let 

d d/2 
cr jf = l J=l 

Let A^-fc denote the — l)-dimensional simplex of probability measures on 5£ k . If In is 

a finite measure on 2£ k (any fc) and g is any function on then {g)h. denotes the integral 
of g with respect to h. For li 6 A^d let 

*-(ih) = <ir%, = <n") h , (4) 

With a slight abuse of notation we write h = h = (h 1 , . . . , h d ) with h j e A = £\% to indicate 
that lh is the product measure h(a) = Yl* =1 h d a .. The Bethe free energy functional is then 
defined for h G A by 

= $ vx (/i) - <t> c (h) = log ^ vx (/i, . . . , h) - log * c (/i, . . . , /i) 

= log { £ ^(a) ( J2 *K°, °')K) d \ ~ ~ log { E ^ 1 5 

this definition clearly extends to d odd. The Bethe prediction is that the asymptotic free 
energy (f> of (3) exists and equals 

$ = sup (5) 

where A* denotes the set of fixed points in A of the belief propagation or Bethe recursion 
BP : A A, defined by 

(BPfc)(a) s ^(^(^^(a,.')^", (6) 

cr' 

with z ft the normalizing constant. For permissive ^, any fixed point h G A* must belong to 
the interior of A (i.e. min CT h a > 0). An interior point h of A belongs to A* if and only if 

^(X>faOV) d ~ 1 = ** VaGiT. (7) 

CT cr' 

In this case, writing z ft = (V^fcoh = ^(^ cr^KK', 

$ vx (h) = \ogz h + logz h , $ c (/i) = ^logz,. (8) 

Fixed points h G A* correspond to "Bethe Gibbs measures," suitable candidates for the 
local weak limit of va n (see e.g. [16, Rmk. 1.12]). 
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1.2.2. Bethe variational principle. Let A e denote the (compact) set of symmetric probability 
measures h on ^T 2 , with one-point marginals denoted by h. Then A* embeds into A e via 
the relation 

K a < = (h gty /i)^, = . (9) 

Let £ = log-?/> and £ = logip. We then define 

= (o, - (d - 1)# (/o + ^[<e>h + H(h)} 

= -H{h\$)--H{h\h®^h). (10) 

In the above and hereafter, for p, q finite non-negative measures on fc , H(p) denotes the 
Shannon entropy — J2 x Px ^ogp x , and H(q\p) denotes the relative entropy ^2 x q x \og(q x /p x ) 
between q and p. We take the usual conventions logO = — oo, OlogO = and 01og(0/0) = 0. 
The Bethe prediction has the following variational characterization: 

Proposition 1.3. Let ^ = be a permissive specification. 

(a) Any interior stationary point h of $ corresponds to h G A* by the bijective relation (9). 
Any local maximizer h of $ is an interior point of A c; so 

$ = sup $(/i). (11) 

h£A c 

(b) An interior stationary point h of 3> is a local maximizer if and only if, for (X, Y) having 
(exchangeable) law h, 

Pxy = sup [ ^ X¥l 1 : if £ 0, = < ^r^T- 12 

t Varc^ X y J 2(d-l) 

Proof, (a) Follows from [16, Thm. 1.16] (using compactness of A c ). 

(b) Let A^ denote the set of functions d : i$T 2 — > K. satisfying 

<5<j CT ' = <5 CT ' CT , ^ = 0, and <5 CT(T ' 2 = 1. 
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It was shown in [16, Propn. 3.4] that an interior stationary point h of $ is a local maximizer 
if and only if 

4d 2 v $»(h + V S)\ v=0 = 2(d - l)((£/fi) a >S - d<(*/fc) a >h < V<5 e A±. (13) 
The condition (12) follows by taking cp = (h + S)/h and rearranging. □ 

Remark 1.4. The "symmetric correlation coefficient" pxy measures dependence of the 
exchangeable pair (X,F). 3 By the classical variance decomposition, p X y G [0, 1] with p^y = 
1 if and only if Y — f(X) for some deterministic function / (which by exchangeability must 
be involutive). If X and Y are independent, it is easily seen from Hoeffding's decomposition 
ip XY = ip XY + E{ip XY | X] + E{ip XY | y] - E[<Pxy] (with E[£ X y | X] = 0) that pxy = 1/2 



3 Note pxy is not the classical correlation coefficient between a(X),a(Y) (see e.g. [13] and references 
therein). 
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with supremum achieved by ip aa i of form (p a + . We do not know of an argument to show 
Pxy > 1/2 for any exchangeable (X,Y). 

1.3. Results for uniformly random rf-regular graphs. 

1.3.1. Expectation of the partition function. For dn even let M.d,n be the space of perfect 
matchings of [dn], and for m G M.d,n let G[m] be the (multi-)graph (i.e. with multi-edges and 
self-loops permitted) on vertex set [n] defined by m through the projection [dn] — > [n] taking 
i' G [dn] to its representative modulo n in [n]. The configuration model is the probability 
measure P^™ on G£% = {G[m] : m G M.d,n\ induced by the uniform measure on matchings 
M.d,n- The measure P^™ conditioned on the set Gd,n of simple graphs is simply the uniform 
measure F d>n on the ci-regular graphs on [n]. We write 0™ = n^E^flog Z n ] where E^™ 
denotes expectation under P^™. 

Theorem 1. For any permissive specification ^ = (ip,t/j), 

cm ee limsup^ < lim -logE^ m n [Z n ] = $ (14) 

n-s>oo n ^°° n 

If G_ is any (multi-)graph on [n] with maximum degree at most d and G is formed by 
adjoining a new vertex % to d or fewer vertices in GL, then 
v 

= J] V^i) 5^ TT (fffli) < VVax = HiaX [^(d) V ^(<T, 0"')], 

p- > ${p>) V TT VK, ^> G _ (a*) > Vffi, ^ min s min A ^{a, a*)], (15) 



so | log — log Zq_ I is uniformly bounded by a constant depending only on d, q, ^. Conse- 
quently, if P n is any probability measure on (multi-)graphs G n = (V n = [n],E n ) with max- 
imum degree at most d, then the Azuma-Hoeffding bound applied to the vertex-revealing 
martingale gives P„[|n _1 \ogZ n — (f> n \ > e] < e~ ce n for some constant c = c(d, q, xjj) > 0. On 
the other hand, even if asymptotically 0™ < $, there is a substantial subclass of graphs 
in G^ with free energy close to $: since n~ x \ogZ n is uniformly bounded over such graphs 
by a constant C = C(d, q,^) < oo, it follows from (14) that p n = P^™ [n -1 log Z n > $ — e] 
satisfies 

$ < lim inf - log (p n e Cri + [1 - p„]e ( *- e)n ) , 
thus p n > e ~[ c '- <I, + e / 2 ] n for large n. Since 

l&,n| x 1^1 x (nd - 1)!! ~ ^(nd/e)^ 2 (16) 

(see e.g. [19, Ch. 9]), this means that the set of graphs in Gd,n with free energy at least $ — e 
also grows like e w 2 ) nlo sn(i-o(i)) _ j n f ac ^ we have the following 

Corollary 1.5. If (jf m < $, then there exist x, C x > with 

P^K 1 log Z n > $ + x] > e^, (17) 
and a sequence of graphs G n G Gdn> G n —*ioc with liminf^oo n~ l \ogZ n > <3> + x. 
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Although the proof of the corollary is straightforward, we highlight it here because it 
demonstrates that if the uniformly random ensemble has free energy <ft n strictly below the 
replica symmetric solution $ - - as is expected to happen in replica symmetry breaking 
regimes — then there is a breaking of homogeneity in the graph space Q c ^ n as well, with a 
large subclass of graphs achieving free energy strictly above <3>. An interesting open question 
is whether the maximal asymptotic free energy is achieved by random bipartite graphs, as 
is known to be the case in two-spin models [27]. 

1.3.2. The Potts Bethe prediction. Surprisingly, another consequence of Thm. 1 is the fol- 
lowing solution to the optimization problems (5) and (10) for the ferromagnetic Potts model. 
Let h { denote the limit of successive iterations of BP starting from the uniform probability 
measure on [q], and let h m denote the limit of successive iterations of BP starting from the 
probability measure on [q] supported on spin 1. 

Theorem 2. For the Potts model (2) with (3, B > 0, $ = $(/i f ) V $(/i m ), and if B > then 
this is strictly greater than &(h) for any h G A*\{h f , h m }. 

We supplement Thm. 2 by a classification of stationary points of <fr (equivalently, via (9), 
solutions of the Potts Bethe recursion) as well as a study of which stationary points can be 
local maximizers. The motivation for considering local maximizers of $ — which after all are 
irrelevant to the Bethe prediction (11) if they are not global maximizers — is that we expect 
these are precisely the fixed points which can be seen in local weak limits of conditioned 
factor models on graph sequences G n -^i oc Tj, in the spirit of [26]. That is, when h is a 
local maximizer of the factor model restricted to configurations of edge empirical measure 
close to h should converge locally weakly to the (Bethe) Gibbs measure corresponding to h. 

Detailed statements are given in Propns. 2.3 and 2.4. We show in particular that any local 
maximizer h of <fr must correspond (via (9)) to h with \{h a : a G <%~}\ < 3. On the other hand 
we show that for B > small and (3 > large there exist solutions h = (Q, p + , p_, . . . , p_) 
corresponding (via (9)) to local maximizers of 

1.4. Results for general d-regular graphs. For d even, we establish the Potts Bethe 
prediction for general graph sequences G n -^i oc T^: 

Theorem 3. For the Potts model on G n — >i oc TLj with d even, 0(/3, B) = $(/3, B) for all 
0,B>O. 

This theorem will be deduced from the following result for abstract factor models which 
illustrates a more general principle. We restrict hereafter to d-regular graph sequences 
G n —>ioc Trf, since in §3 we will show that, for the purposes of computing the free energy, 
general sequences G n — >i oc T d can be reduced to the d- regular case using the uniform sparsity 
hypothesis. 

For fixed G = (V, E) we let Eg denote expectation over the uniformly random vertex 
Ig G V. We define symmetrized versions of \l/ e , \l/ by 

# e ' sym (]M = — V ^W), * sym = — — 

v ' q]\ Z-^ v n \]/e,sym 
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where Sd denotes the symmetric group on d letters and h. n (a) = h.(a n n), . . . , <J n (d))- Given a 
measure p on A d define the mixture of product measures 

p G A xd , p(a) = J h\ ■ ■ ■ h d ad dp(h), (18) 

Throughout we write o(t, x) for a uniformly bounded function such that 

limo(t, x) = o(t), lim o(t) = 0. 

The function may change from line to line with the understanding that it can be chosen to 
depend only on d, q,2k- 

Theorem 4. Suppose ^ is a permissive specification and G n — >i oc TLj, d even. Suppose that 
for all finite d-regular graphs G, all v G G, and all t G Z> , there is a measure p = p(G, v, t) 
on A d such that 

(i) Fori = Ig andi>- = ^G\/(£a/ = •); ^ e measure p satisfies Eg[||i/_ — p|| T v] = °(t,(t(G)). 

(ii) For all v, t we have log \l/ sym (/i) < $ + o(t) for all h G supp p(G, v, t). 
Then lim sup n <p n < $ for the factor model on G n specified by 

In §4 we show that the conditions of the preceding theorem are satisfied in the Potts 
model with (3,B > and hence limsup^^ 4> n (j3, B) < $(/3,S); Thm. 3 is then proved as 
the matching lower bound liminf^oo <p n (/3, B) > B) was shown in [16, Thm. 1.10]. 
To give some motivation for the conditions of Thm. 4, consider the following randomized 
operation R on finite d- regular graphs G = (V, E): 
Operation R: 

Let (t>i, . . . ,Vd) be an enumeration of the neighbors of a uniformly random 
vertex I = I G in G. For each 7r G Sd let G n be the graph formed by adding 
to G_ = G\I the edges (v^Mj-i), v Tv(2j)) for 1 < j < d/2. Then set RG = G n 
for 7T G arg min w / [log Z G — log Z Qv i\. 

Thm. 4 is then proved (in §3) by expressing E n [log Z n ] as a telescoping sum over E n [log Z nJG — 
log 

Z~r?+ 1 g\i an d showing that conditions (i) and (ii) above imply that each term in the sum 
is bounded above by <J>: indeed, with z/_ = vg_{o_q I — •) G A X d as before, notice that 

Z<; (^ vx Gl)>,- = ^ vx (^-), |^ = W£)>^ = ^°(^)- (19) 



Condition (i) gives that these are well approximated by 

W x (p)= [ ^ vx (h)dp(h), ^ c (p n )= I ^ c (h n )dp(h) (20) 



The Bethe ansatz is that logty(h), for h varying in (a possibly restricted subset of) A d , 
is maximized at the replica symmetric solution W = h, with value log\I/(/i, . . . , h) = $(/i). 
Condition (ii) says that this holds for all h G supp p in an averaged sense. Of several natural 
modifications of R which we considered for the case of d odd, all fail condition (ii). 
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2. Uniformly random ^-regular graphs 

2.1. Expectation of the partition function. For a a spin configuration on G = (V,E), 
the edge empirical measure L e n G A c of a is defined by 

KM = ^- im^^i) = M}+ Hiaj,^) = (a, a')}], 

1 1 (ij)eE 

and we write for its one-point marginal. Let Af ,n denote the set of edge empirical 
measures associated to spin configurations on graphs in Qf^ (in the notation of §1.3.1). 

Lemma 2.1. For h e Af n , 

\&\ n F%[L* n = h} = exp{n[(d/2)H(h) - (d - l)H(h)] + err} 
where |err| < clogn for c a finite constant depending only on d, q. 
Proof. Writing = [q], we compute 

\^rF™[L c n = h] = £(h)M(h) (21) 

where 



nh\ , . . . , nh q J \ndh al , . . . , ndh 



is the number of ways to assign spin values to the nd half-edges subject to pair empirical 
measure h, and 

Wl(h) = [(nd - l)!!]- 1 \[(ndh aa - 1)!! V( ndh ™>) 1 

is the number of perfect matchings m G M.d,n on half-edges respecting the spin assignment 
divided by the total number |A^d, n | of matchings of [nd]. With the convention 0° = 1, 
Stirling's approximation (see e.g. [31]) 

T(z + 1) = e o ^ +1 ^[V2^} 1{z>0} (z/e) z 
gives C(h) = exp{n[dH(h) — (d — l)H(h)] + O(logn)}. Similarly, for n even, 

(„ _ i)!! = ^r(^) = e°«" +1 )- 1 )(n/e)-/ 2 y2 

\/7T V 2 / 



so 931(h) = exp{ — (nd/2)H(h) + O(logn)}, and the lemma follows. □ 

Proof of 'Thru. 1. The inequality follows trivially from Jensen's inequality. To compute 
E^™[Z n ], let a be drawn from the uniform distribution on and independently let 

G n ~ ^dn- Writing Lr n for the edge empirical measure of a regarded as a spin configu- 
ration on G n , the expected partition function for the model specified by %ji on G n can be 
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expressed as 

E&[2»]= E m n V7nlK = h]ex P {n(0- h + (nd/2)(0 h } 

= exp{n$(h) + 0(logn)}, (22) 

where the last line follows from (10) and Lem. 2.1. By Propn. 1.3 (a), <fr attains its global 
maximum at an interior point h* G A c , which by (21) lies within distance 0(l/n) of 
Ag' n . Moreover the cardinality of Af ,n is trivially bounded above by (nd) q , so we find 
logE^[Z n ] = n&(h*) + O(logn), implying the theorem. □ 

Turning to the proof of Cor. 1.5, we first note that in the uniformly random d- regular graph 
ensemble, for any fixed e > and t > we have (t{G) < e with overwhelming probability as 
n — > oo: 

Lemma 2.2. For any e > 0, t > there exists a constant a = a(d,e,t) such that both 
^TnlUG) > e], Pd,n[Ct(G) > e] are bounded above by e - anl ° sn . 

Proof. Consider the process of revealing the graph G ~ P^™ edge by edge. At the fc-th step 
define Ik to be the indicator that the edge forms a cycle of length < It within the graph 
revealed so far: if Ct(G) > e then we must have ^2 k h > cto^ for some cto = ao(d,e,t) > 0. 
For k < n(d — ao)/2, the conditional probability of If. — 1 is at most OL\jn for o.\ = 
cti(d, e, t, q:q) > 0. By a classical martingale inequality (see e.g. [21, Thm. 6.1]), 

VZMG) >e]< [j> > H < P [ Yl t*^ T 

fc fc<n(d-Q )/2 

/ ai/n \™<W 2 / 1 Nn(d/2-o ) 
< [ _LZ \ ( ) < e ~a 2 nlogn 

~ \a /(d — a )J \1 — a /(d — a )J ~ 

for c*2 = a2(d,e,t,ao,azx) > 0, which proves the result for P^™. The result for P^ n follows 
immediately by (16). □ 

Proof of Cor. 1.5. For let p^(y) = P^[n _1 logZ n >$ + ?/]. Take 5 > such that 

lim inf n^oo [$ — </>^] > 25 > 0, and recall from §1.3.1 that the Azuma-Hoeffding bound 
implies p^™(— 5) < e~ c52n . For any e > 0, it holds for sufficiently large n that 

< e (*~*)« + p cm (x)e Cn + (p » (4) _ p BB (x))e (»+.)» x > __ & 

Taking < e < M(c5 2 /2) and < x < (c5 2 /2) A (C — $) gives > e -"[ -*+ 2e ] = e" c * n 

which proves (17). By Lem. 2.2 we can let e 1 and t f oo slowly enough in n that 

hm — = 0, 

implying the existence of G n G Qf^ L , G n —>i oc T d with liminf^oo n^ 1 \ogZ n > $ + x. □ 
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2.2. The Potts Bethe functional. In the remainder of §2 we study global and local max- 
ima of the Bethe functional <fr for the ferromagetic Potts specification. 

The Potts Bethe recursion for 0,B > preserves the subspace A of measures 

he A, h = (h u ...,h q ) = ^(l + (q-l)b,l-b,...,l-b), (23) 

parametrized by < b < 1. The map BP restricted to this subset is simply a univariate re- 
cursion b i — y BP(6): in terms of the log-likelihood ratio r = log(/zi//i2), it has the particularly 
simple form 

BP : r ^ B + (d- 1) log ^ , (24) 

analyzed e.g. in [16, Lem. 4.6]. The maximal and minimal fixed points (in b) are given by 

6 f = lim BP w (0), b m ee lim BP W (1). 

t— >00 t— >OD 

By monotonicity, 6 f and b m are well-defined with b { < b m , and they are the only fixed points 
within this set. 

In §2.2.1 we use the result of Thm. 1 to prove Thm. 2 that $ attains its global maximum 
on one of the edge empirical measures corresponding (via (9)) to b f ,b m , and moreover that 
when B > these are the only possible global maximizers. This gives an essentially explicit 
solution to the Bethe variational problem for the ferromagnetic Potts model, and we do not 
know of a proof which does not go through the probabilistic results of Thm. 1. 

In §2.2.2 we supplement Thm. 2 with a study of the local maximizers of In view of the 
calculation (22), we expect local maximizers of h of $ to have the following probabilistic 
interpretation, which is in the spirit of results of [26]: if G n — >i oc T^, the factor model on 
G n conditioned to the subspace of configurations with edge empirical measure L e n close to 
h should converge locally weakly to the (Bethe) Gibbs measure corresponding to h. With 
this motivation in mind we classify the stationary points of $ and study which ones can be 
local maximizers. 

2.2.1. Global maximum. We first review the well-known random-cluster (fk) representation 
of the Potts model. The Edwards-Sokal (es) measure on a finite graph G = (V, E) is the 
probability measure on pairs (<r, rj), where q_ e ^ v is a spin configuration as before and 
{0,1}^ is a bond configuration, given by 

w G (a )ZZ )oc l[e B1 ^=V J] [(l-p) 1 -*+p'*l{<Ti = <T j }], p = l-e-P. 

i£V e=(ij)£E 

The marginal on a is the Potts model with parameters (/3,B), while the marginal on n is 
the FK measure 

^ocn^a-p) 1 ^ n (i+(<?-i)e- Bic| ), 

eeE ce*"(a) 

where the second product is taken over the collection ^(rj) of connected components C of n 
(with \C\ the number of vertices in C). Conditioned on an FK configuration n with connected 
components C\, . . . ,Ck (with k = k{n) < n), a realization of a from wq{- \ rj) is obtained 



REPLICA SYMMETRY FOR POTTS MODELS ON d-REGULAR GRAPHS 13 

by giving the same spin <j£ to all the vertices of each component C#, independently over the 
different components, such that 

Mat = <y\R)= u E e B \ c ^= l \ u e ee eB \ CA l +q _ v 

Proof of Thm. 2. We assume without loss that B > 0, with the result for B = following by 
continuity. Take G n any graph on [n] and n any bond configuration on G n , with connected 
components C\, . . . , Ck- For a ^ 1 let = Yf = |C^|[l{<7^ = a} — m]. It is easily verified 
that the cumulant generating functions Kt(t) = logE[e' yf ] = log(l + ug(e^ Ce ^ — 1)) — tue\Cg\ 
satisfy sup 4<B / 2 n"(t) = k'[{B/2) < c for some finite constant c = c(B,q) not depending on 
\Ce\, and so 

k 

w Gn ( Y ? ^ en I a) < e- nlet ~ ct2/2] < e" ne2/(2c) provided e < cB/2. 
i=i 

Thus it holds with vjg 71 {' I ZZ) _ P r °bability at least 1— e _v/ ™ that L™{q) belongs to the subspace 
A n of measures of A within distance n^ 1 ^ 8 of the space A defined above. Consequently, 
if Z^ al denotes the Potts partition function of G n restricted to {a : L™(o_) G A n }, then 
Z^ al /Z n > 1 — e _v/ ™ since this ratio is simply the average of zuG n (L™(a) G A n | rf) (as a 
function of rf) with respect to n Gn . 

Now recall the calculation (22) for the random regular graphs. For h G A c with h ^ A, 
for sufficiently large n we have h ^ A n , so the contribution to E^™ [Z n ] from configurations 
a with £^(cx) = h is 

exp{n$(h) + O(logn)} < e~^E^[Z n ] = exp{n$ - O(logn)}, 

so we see that any global maximizer h for <fr must lie in A. Let h G A* correspond to h via 
(9): summing (9) over a' G [q] gives 

Z/X = ^[(^-1)^ + 1], 

which implies (since the right-hand side is increasing in h c for h a > 0) that h is symmetric 
among the spins 7^ 1 and has a non- negative bias towards spin 1. It is easily checked that 
the only such h are h { and h m (see e.g. [16, Lem. 4.6]) which concludes the proof. □ 

2.2.2. Local maxima. For the g-Potts model with B > and (3 > we reparametrize m = 
e B >1,9 = l/(e /3 - 1) > 0, so that (7) simplifies to 

mF(/i x ) = F(/i 2 ) = . . . = F(/i,) = z h , F(x) = x~\x/6 + l)*" 1 . 

For v = 9/(d — 2), F = F\(o, v ] is monotone decreasing while F + = F| is monotone 
increasing, so clearly |{/z.2, • • • , h q }\ < 2. More precisely, we have the following classification: 

Proposition 2.3. For the q-Potts model with parameters m = e B > 1 and 6 = l/(e /3 — 1) > 
; for any h G A* there exists 1 < i < q and it E S q such that 

Q± = f± l (z h /m), a = 1, 
^ (CT) = <( p + = F; 1 ^), 2<<t<£, (25) 
_ ee Fl 1 ^), £ + l<a<q, 
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and 7r(l) = 1 ifm>l. 

(a) Ifv>l or mf(v) > F(l) then £ = 1 for all h G A*. 

(7>,) We say that h G A* is an £±-type solution if (25) holds with h n m = Q±. For I > 2, if 
qv < 1 and 1 < m < mt{9) then A* has £±-type elements. 

Proof. It is clear from the preceding discussion that every h G A* is of the form described 
in (25). 4 Fixed points of (24) correspond to 1±- or g±-type solutions. 

(a) If v > 1 then F = F + is injective on (0, 1] so necessarily i — 1. If h G A* has £ > 1 
then p + < l/(£— 1), so for Q ± = F^ 1 [F(p + )/m] to be well-defined we must have mF(v) < 
F[l/(£ — 1)]. In particular, if mF(v) > F(l) then again all solutions must have £ = 1. 

(b) Assuming v < 1, the function 



g e ± m (p) = Q±(p) + (£- l)p + (q~ £)P4P) 

= F ± l [F{p)/m} + (£- l)p + (q- ^)Fl 1 [F(p)] 

is well-defined for p G [po, 1] where po = po(m) = F^ 1 [mF(t>)]. Note that ^± m (l) > £ — 1, and 
lim m ^ 1 p ( m ) — w which implies lim m ^! ^± m (po) = <?t>. If £ > 2 and gt> < 1 then continuity 
of g± m (p) implies that for m > 1 sufficiently small we will have g^ m (p + ) = 1 for some 
p + G [po, 1], giving an £±-type solution as claimed. □ 

We next study which of the stationary points classified in Propn. 2.3 correspond to local 
maximizers for 

Proposition 2.4. In the setting of Propn. 2.3, 

(a) Solutions of type £± with £ > 2 are never local maximizers. 

(b) Form > 1,8 > both sufficiently small, there exist both l + -type and 2^-type solutions 
which are strict local maximizers with (strictly) negative-definite Hessians. 5 

Proof, (a) Let h G A c be the stationary point of $ corresponding to h via (9). We will 
apply the correlation criterion (12) with ip aa < = (p a + (p a i. Let h correspond to h G A* via 
(9), so that 

= /W _ h a ,{8 + l{a = a'}) 

a ' w ~ K ~ o + K 

If we assume (<p)h = 0, then 

8 h 

E[ip Y | X = a] = (<p) h + \ (p a = 7 a ^ ff 

8 + h a 8 + h a 



The terminology degenerates in some cases, in particular when m — 1: in this case Q € {p-j_} so the 
£+-type solutions coincide with the (£ + l)_-type solutions for 1 < £ < q, and the only l_-type or q + -type 
solution is the uniform distribution on [q\. 

5 If qv < 1 then there can be no l_-type solutions. 
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for 7 CT = h a J {6 + h a ). Thus K[cp X Y \ X] — (1 + 7x)y?x? and (12) becomes 
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2(E<^) 2 >E (l + 7 xV 
= E 



(d - l) 7 x - 1 



ft J ^ 2 h X~V 

(1 + 7x)^ x 



+ 



d- 2 
1 

# + \\h\\ 2 



(26) 



(using h a = h a (6 + /i CT ) /(9 + \\h\\ 2 ) for the last identity). If h is an £-type solution with £ > 2 
then yjg- = 1{<7 = 7r(2)} — l{a = vr(3)} (for 7r as in (25)) clearly violates (26), so h cannot 
be a local maximizer of 

(b) Let m = 1 and # sufficiently small so that a l + -type (and 2_-type) solution h = 
(Q+, p_, . . . , p_) G A* exists, given by taking the log-likelihood ratio r = log(Q + /p_) to 
be the maximal fixed point of the mapping BP of (24). For d > 3 and < e < 1, 

BP[(d - 1 - e)0] > (d - 1)(B - logg) > (d - 1 - e)0 V/3 > (rf ~ 1)logg , 

e 

so crudely we have r > (3/2)/3 for all /3 > 2(d — 1) logg. Let h G A e be the stationary point 
corresponding to this fixed point: recalling (13), for 8 G A^ we calculate 



E^ ^ 



«/. (( ^ //l)2);i Q + (e^Q + + (g - l)p_) "~ p_(Q + + (e/ 3 + g - 2)p_ 
1 



< 



5? , E^i^ 



Q+p 



-((<w 2 >/*>7^ + 



5n 2 , 2 E CT =^1 ^1<7 2 , E ct ,<j'^1 



+ 



z h - - Q 2 + e? ' Q+p. 
Since p < e~( 3 / 2 ^Q + for sufficiently large /3. 
eV 



eV- 



lim ^<( W) 2 , = 0, liminf ^{{8/h) 2 ) h > £ 8^ 



so for any fixed e > we have d 2 <&^{h + /7*5) |r?=o < uniformly over all 5 G A^ with 
(<7 ~~ I) 2 EuffMi ^o-o-' 2 > e 2 once /3 is sufficiently large (depending on e). 

Suppose instead (g — l) 2 Eo-o-'^i ^o-o-' 2 < e 2 : by Cauchy-Schwarz Eo-o-yi \^< T(T '\ — e ' 

Q+P - [2(d - 1) {{8/hf)h - d((5/h) 2 )^] < 2(d - 1) % ~ 2d E ^ 



so 



lim sup 

/3->oo -2ft 



< 2(d - 1) eo^-i + £ ] 2 - 2rf £ ^ 2 - ~ 2 E 5i - 2 + 2 ( rf - x ) [ 2e E i^-i + - X < 

On the other hand, 6 G A^ implies 

-2 n 1/2 



2 ^<5iJ = |<5n + e| > |*n|-e> 1-2^<5 1(T 2 - 



( q -m 



— e 



so by choosing e > sufficiently small we can guarantee that for /3 large enough, d 2 Q^(h + 
?7<5 ) | ^=0 < uniformly over all 8 G A^f , implying that h is a strict local maximizer of <fr with 
strictly negative-definite Hessian. 
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This concludes the proof for m — 1, and the conclusion for m > 1 sufficiently small 
follows by a perturbative argument: arguing similarly as in the proof of Propn. 2.3 (b), for 
1 < m < mo the equations 

g 1+ (p) = F- 1 [F(p)/m] + (q-l)p=l, 

g 2 -(p) = Fl^Cp)/™] + F; 1 [F(p)] + (q-2)p=l 

have solutions pl + (m), p__~(m), corresponding to l + -type and 2_-type solutions respectively, 
which are continuous in m with initial values p 1+ (l) = p 2_ (l) = P_ corresponding to the 
solution considered above at m — 1. For sufficiently small m, it follows by continuity that 
the Hessians at the stationary points h 1+ (m), h 2 ~{m) corresponding to p_: + (m), p?_~(m) will 
be strictly negative-definite, implying strict local maximizers as claimed. □ 

Remark 2.5. Related to the study of local maxima is the question of the local stability 
of the Bethe recursion. For the Potts specification (2), the linear (differential) mapping 
Dh = DBP(h) defined on the space {5 : ~Y^ a 5 a = 0} by 

z^ii m BP ( ft + "*>- BP W 

can be explicitly diagonalized when h G A* and shown to have all eigenvalues positive, with 
maximal eigenvalue greater than 1 at £±-type solutions with i > 2 and at 2 + -type solutions. 
At a 2_-type solution (assuming m > 1, so it is not also a l + -type solution) the maximal 
eigenvalue is less than 1 if and only if 

p'i >44 + ^ +( ,_ 2) ^. (27) 



p + — v d — 1 v — Q_ v — p 

However, if h is not the uniform measure on [q] then is not symmetric and so does not have 
orthonormal eigenbasis, so having all eigenvalues less than 1 need not imply contractivity of 
Dh- It is not clear how to relate (27) to the local stability of the non-linear map BP. 

3. Recursive graph decomposition 

In this section we prove Thm. 4. Recall from §1.4 the notation o(t, x); we also let c denote 
a finite positive constant which is permitted to change from line to line but depends only 
on d, q,2t- The following lemma, whose proof we defer to the end of the section, reduces the 
free energy computation on general G n -^i oc T d to the case of d- regular graphs. 

Lemma 3.1. If G n -^i oc with d even, then there exists a d-regular (multi-) graph sequence 
G' n -> /oc T d with free energy <p' n such that lim n _ >oo (0 n -<j/ n ) = 0. 

We prove Thm. 4 via the following propositions about the operation R on <i-regular graphs. 

Proposition 3.2. Let G = (V, E) be any finite d-regular graph, and recall denotes 
expectation over the uniformly random vertex 1q. Under the conditions of Thm. 4, 

E G [logZ G -logZ RG ] <$ + o(*,C t (G)). 
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Proposition 3.3. Suppose G n -^i oc T^. Then for all €q > and all t > 0, 

= 0. 
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lim E„ 



max C t CRPG ri 

0<j<(l-e )n 



We first assume the preceding results and derive Thm. 4: 

Proof of Thm. 4- Take e > fixed, let n = [(1 — e )n\, and let e = e n be defined by 
uq = (1 — e n )n. Express the free energy of the factor model on G n as the telescoping sum 

- E ^ogZ WGn - logZ R j+l G J + -E n [logZ R n 0Gn ]. 



?? 



J'=0 



By definition of R, R n °G n is a <i-regular graph on en vertices, son 1 \ log Z-Rn oGn \ < ce. Next, 
Propn. 3.2 gives 

max E n [logZ RJGn -logZ RJ+ i G J - $ < o(t, max E n [C t (R*G n )]) 

0<j<n 0<£<n 

(where we may freely move the expectation inside o(t, •) by uniform boundedness of o). In 
the limit n — > oo the right-hand side above tends to o(t) by Propn. 3.3, so the telescoping 
sum yields 

lim sup n < (1 - e )[$ + o(t)] + ce . 



The result follows by taking first t — >• oo and then e 4 0. 



□ 



The remainder of this section is devoted to proving Propns. 3.2 and 3.3 and Lem. 3.1. 
Recall (15) that Vmtn < Z G /Z G _ < similarly 



d/2 >^l> V , ((T P )(T P)^_ (a 



Z G 

for 3g = Vw/ ip : 



1)1 



a 



p ) > 



d/2 



> 



d/2 



\ar\i&\dl\&\WpT)\ - \^\d^d{d+i) 



(2? 



Proof of Propn. 3.2. Fix t > 0, and let p = p(G,I,t) as in the statement of Thm. 4. It 
follows from condition (i) — recalling (19) and (20) and making use of the boundedness of 
t£ — that 



J G- 



+ 



J G n 



J G- 



— ^ e (p" 



< o(t, Ct(G)) with probability > 1 - o(i, Ct(G))- 



Then, by the choice of permutation in the definition of R and using the bounds (15), (28) 
on the ratios Z G /Z G _ and Z G k/Z g _, we find 



E 



G 



log- 



J G 



J KG 



E, 



G 



. Z G /Z G _ 
log mm ■ 



vr Z G tt/Z G _ 



< E, 



G 



log 



^ vx (p) 



o(f,C*(G)) 



< E r 



log max V sym {h) +o(t,tt(G)). 
hesupp p J 



Condition (ii) of Thm. 4 gives that the first term is < $ + o(t) which implies the result. □ 
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Proof of Propn. 3.3. Let G be a graph on n vertices. For fixed e\ > (to be determined), let 
e = e n be defined by e n n = [ein] , and consider en successive applications of R. Assume £ > 
is a (large) power of 2, and for v G G let r„ be the indicator of the event that v is one of the 
n[l-Ct(G)] vertices with B t (v) = T d , but B t/2 (v) ^ T* /2 in WG for some j < en. Then E[T V ] 
is bounded above by the probability that at least t/2 vertices are deleted along a length- 1 
geodesic path started from v. For a single path, this probability is W[H > t/2] where H is 
a hypergeometric random variable with parameters n, t, en. By a standard hypergeometric 
tail bound (see [11]), 

r / 1 1 1 n \ l—"ft/n-\ en 

AH > jet] < Y l hV tln { l _ lt/n ) < (l/7) 7ei e (7 " 1)et < exp{- 7 et[log 7 - 1]}. 

Taking 7 = 1/ (2e) and summing over d l geodesies gives 

E[r„] < exp { - i ( log 1 - 2 log d - l) } , 

which can clearly be made < e~*/ 4 by taking e x sufficiently small (depending only on d) and 
n, t large. Markov's inequality applied to the Y v (v G G) then gives 



P 



max (ra - j)( t /2(R j G) > n( t (G) + ne~ 



-t/8 ^ e -t/8 

.0<i<en 

Iterating for L = |~(logeo)/log(l — ei)] steps, and writing t(L) = t/2 L , gives that with 
probability at least 1 — Le~^ L ^ 8 , 

(n - j)( t/2 L(R j G) < nCt(G) + Lne"* (L)/8 for all < j < (1 - e )n. 

Therefore, for G = G n — >i oc Td, 

E n max Ci/2i(R- J G n ) < . 

o<j<(i-e )n e n 

In the limit n — > 00 the right-hand side tends to Le~^ L ^ 8 /eq which decreases to zero as 
t — > 00, but the left-hand side is non-decreasing in t so it must in fact tend to zero as n — > 00 
for all t, as stated. □ 

Proof of Lem. 3.1. Fix n for the moment and suppress it from the notation. Delete edges 
in G incident to vertices of degree larger than d until none remain, and denote the resulting 
graph G" = (V, E"). Let U denote the set of vertices incident to any edge in E\E"; arguing 
as for the bounds (15) and (28) then gives 

7 1 \E\E"\ I \E\E"\ 

J,\E\E"\ > > ,\E\E"\ , = px > ^min > ^min 

V^max - Zqh - ^min ^G"\2jj - O ) _ \gr\\u\g\ U \+\8U\ ~ | % | \U\^(d+l)\U\ 

(where the last inequality uses that G" has maximum degree at most d). Then note that 
\U\/2 < \E\E"\ < YtvecKDv ~ d) V 0} < \V\ E G [D Ia l{D Ia > d}), so | logZ G - logZ G »| < 
c\V\E g [D Ig 1{D Ig > d}]. 

Now let W denote the set of vertices v G V whose degree D" in G" is less than d, and 
add d — D" new half-edges leaving from each such v. The number of unmatched half-edges 
resulting from this operation has the same parity as ^2 veG (D v — d) = 2\E\ — d\V\, so it is 
even. Taking a random matching of these half-edges results in a <i-regular (multi-)graph G' . 
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The total number Se of edge insertion or deletion operations to go from G" to G' is at most 
a constant times | V|Eg[(£)/ g V 1)1{D Ig ^ d}], since if v G W then either v G U or D v < d in 
G. Each such operation changes the log-partition function by at most an additive constant, 
since all graphs involved have maximum degree at most d. Therefore 

\logZ GI ,-logZ G ,\ <c\V\E G [{D lG Vl)l{D lG ^d}\. 
Combining with the previous bound on | log Zq — log Zq" | gives 

\(j> n -4>' n \<c^ n [{D In yl)l{D Inl Ld}} 

< cLF n [D In ^d] + cE n [D In l{D In > L}\. 
The lemma follows by taking first n — > oo and then L — > oo in the bound above. □ 

4. The Potts free energy density 

In this section we prove our main result Thm. 3 giving the free energy density of the q- 
Potts model on graphs converging locally to the d- regular tree with d even. Let A [e] denote 
the measures h G A of form (23) with b G [b f — e, 6 m + e] . 

Proposition 4.1. For the Potts model with (3,B > 0, logsup ftg (A[o]) d ^ sym (^) = ( &- 

Proposition 4.2. For the Potts model with (3 > and B > 0, for all finite d-regular graphs 
G, all v G G, and all t G Z> 0; there is a measure p = p(G,v,t) on (A[o(t)]) d such that for 
I = Ig and V- = vc\i((Ldi — the measure p of (18) satisfies E G [||z/_ — p|| T v] < °(t,Ct(G))- 

Proof of Thm. 3. The result follows from Thm. 4 since condition (i) is verified by Propn. 4.2 
while condition (ii) is verified by Propn. 4.1 (using continuity of \|> sym on A d ). □ 

Proof of Propn. 4-1- Step 1. For h G (A[0]) d we abuse notation and write ^(h) = ^{b) with 
h j (l) = (1 + (g - l)V)/q. Then 

d d 
C- d ^(b) = e B + r,V) + (q - 1) " 7^/(5 " 

d/2 

c- d/2 ^ c (b) = Y[(i + 1 b 2] - i b 2j ) 

where C ee (e^ + q- l)/q, j ee (q- l)(e /3 - l)/(e /3 +q- 1) > 0. Both ^ vx and ^ e - sym are affine 
in each V (keeping (b k )k^j fixed), and so \|> sym is maximized with V G {b f ,b m }. Therefore 

sup ^ m {h) = sup ^ sym (6). 

he(A[o]) d be{b f ,fe m } d 

Step 2. Let g s (b) = log ^(6, 6 s , ... , b s ) — (d/2) logC for s G {f, m}; we claim that g s is constant 
in b. To see this, note that fy vx (h) = Zs{h), the partition function of the Potts model on the 
star graph S = with boundary conditions <jj ~ h? independently for the vertices j G do. 
Similarly, ^ c (h) = Zji(h), the partition function of the Potts model on the graph R of d/2 
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disjoint edges (2k — 1, 2k), again with boundary conditions <jj ~ h? independently for all j. 
Now if 5" is S with the edge (o, 1) disconnected, then, using the BP relations for b s , 

Z s (b,b s ,...,b s ) 



Z s >{b, b s ,...,b s ) 



-fbb s . 



But Z s > does not depend on b, so with 6 s = (b s , ■ ■ ■ ,b s ) we have 

Z s (b,b s ,...,b s ) 1 + 



Zs(W) "1 + 7 (6 S ) 2 - (29) 

If R' denotes R with the edge (1,2) disconnected then by the same argument (29) holds with 
S, S' replaced by R, R', and so 

Step 3. Let b G {b f ,b m } d , and let £ denote the number of indices j for which V = b m . Then 

cr^o = + ^(i±f )' + (, - i)<i - *7< 9 - i)y( \-_^:^ ' 

= A e a ° e + A ie ~ aii = exp{/ vx (£)}, A,, aj > 0. 
By Jensen's inequality, 

iog^ e ' sym © > \. lo s^ c (^) = f c ( £ ) + w 2 ) l °£ c - 

Note that for £ G {0, 1, d- 1, d}, both sides are equal to log^ e (6). For < i < d, 
f e (£) = [2(d - l)]" 1 W - l)C mm + (d-£)(d-£- l)C ff + 2£(d - £)C mi ]} 
= a 4 £ 2 + a^£ + a 2 , 

where C ss / = log(l + 7& S 6 S ') for s, s' G {f, m}, and 2(d — l)a 4 = C mm + Cff — 2C m f which for 
6 f < b m is strictly positive by the arithmetic-geometric mean inequality. If we now consider 
/ = f vx — f c as a function of £ G E, then 

v4 a e ao£ - Aiaie^ aii 
tends to =)=oo as £ — >• ±oo. Moreover 

which can have at most one real zero. Thus /' has at most one inflection point, hence at 
most three real zeroes; further, if there are three zeroes then the middle one corresponds to 
a local minimum of /. But f(£) = g s (£) for £ G {0, l,d— 1, d}, so Step 2 implies /(0) = /(l) 
and /(d — 1) = /(d), and consequently /' has zeroes in (0, 1) and (d — l,d). Therefore / 
cannot have a local maximum in [1, d — 1], so it is maximized over {0, . . . , d} with £ G {0, d}, 
completing the proof. □ 



/ (*•) = — a e a i 2a 4 « — a 3 
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To decompose the measures v_ as mixtures over (A[o(i)]) d , we use the random-cluster 
(fk) representation reviewed in §2. 

Proof of Propn. 4-2. For < s < t < oo and v G G let A S:t (v) = B t (v)\B s (v). It holds with 
probability at least 1 — o(t, (t(G)) that for I = I G with dl = (vi, . . . , v^), the balls B t {yi) 
defined with respect to G_ = G\I are pairwise disjoint and isomorphic to the first t levels 
of the (d — l)-ary tree. On this event, V- = uc_(g^ 9I = ■) has the decomposition 

d 

"-(0 = Yl w -(-\UA s , t )^-(UA s , t ), A s,t = \jA Stt {Vi) 

(where = zu G _(a SI = •) and 7r_ = 7T G _). We claim that with s = \ogt the measures 
E7_(- \ua t ) ^ ^\% d are approximately in (A[o(t)]) d . Indeed, write B t for the disjoint union 
of the balls B t (vi), and decompose 

|2Za s ,J = IsbJMzzb, |zz As , t ) (so) 

For | G {0, 1} write w_ and 7TI for the ES and FK measures respectively on G_ conditioned 
on ZZg_\b ( = X (^ ne measures , 7T° restrict simply to the ES, FK on B t ). The FKG property 
of random-cluster measures (see e.g. [7, Thm. Ill.l(i)]) implies the stochastic domination 
relations ir°_ ( 2Bt = • | r^J MzZb ( = • I ZZ As ,J ^ ^(zZb* = * I Ha,,*)- Now note that 

^mMaJ . ,_ n c ^[i + (g-i)e" B|c| ] 

^(2ZbJ2a s ,J ^s^- l + (g-l)e-^l 

where =5^ is the set of connected components C of ri Bt joining B s (vi) to the boundary of 
B t (vi) for some i, and is the union of these components with the complement of B t . 
Each C G 5? has size at least t — s and ^ has cardinality at most d s , so p(n Bt ) — 1 
uniformly in 7£b as £ — > oo. 

Similarly, ro_(- IzZbJ * s we ^ approximated by w°_{- |zZb ) (within total variation distance 
o{t)) uniformly over all r[ Bt , since any cr v . for t>j not connected to dB t {vi) in n has the same 
distribution under the two measures independently of all the other o v . , while the o~ Vj for un- 
connected to dB t (vj) equal 1 with probability at least e Bt /[e Bt + (q— 1)]. It thus follows from 
(30) that ||tX7_(- \ UA st ) ~ I ZZa s *) Htv < o(£) uniformly over all Q Ast - But under w°_ the 
cr^ are exactly independent, and since s = s(t) — > oo the marginal laws of the cr^ resulting 
from the Potts-FK coupling belong to A[o(£)], proving the result. □ 
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